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Abstract 

Histone modifications are important regulators of gene expression in all eukaryotes. In Plasmodium falciparum, these 
epigenetic marks regulate expression of genes involved in several aspects of host-parasite interactions, including antigenic 
variation. While the identities and genomic positions of many histone modifications have now been cataloged, how they are 
targeted to defined genomic regions remains poorly understood. For example, how variant antigen encoding loci (var) are 
targeted for deposition of unique histone marks is a mystery that continues to perplex the field. Here we describe the 
recruitment of an ortholog of the histone modifier SET2 to var genes through direct interactions with the C-terminal domain 
(CTD) of RNA polymerase II. In higher eukaryotes, SET2 is a histone methyltransferase recruited by RNA pol II during mRNA 
transcription; however, the ortholog in P. falciparum (PfSET2) has an atypical architecture and its role in regulating 
transcription is unknown. Here we show that PfSET2 binds to the unphosphorylated form of the CTD, a property 
inconsistent with its recruitment during mRNA synthesis. Further, we show that H3K36me3, the epigenetic mark deposited 
by PfSET2, is enriched at both active and silent var gene loci, providing additional evidence that its recruitment is not 
associated with mRNA production. Over-expression of a dominant negative form of PfSET2 designed to disrupt binding to 
RNA pol II induced rapid var gene expression switching, confirming both the importance of PfSET2 in var gene regulation 
and a role for RNA pol II in its recruitment. RNA pol II is known to transcribe non-coding RNAs from both active and silent var 
genes, providing a possible mechanism by which it could recruit PfSET2 to var loci. This work unifies previous reports of 
histone modifications, the production of ncRNAs, and the promoter activity of var introns into a mechanism that contributes 
to antigenic variation by malaria parasites. 
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Introduction 

The use of alternative histone modifications is an important 
mechanism utilized by eukaryotic cells to control many aspects of 
nuclear biology, including patterns of gene expression, DNA 
replication, chromosome segregation and overall genome organi- 
zation. In malaria parasites, histone modifications have been 
shown to play an important role in regulating many features of 
host-parasite interactions, including red blood cell invasion [1-3], 
expression of nutrient uptake channels [4] , and antigenic variation 
[5-8]. Systematic, genome-wide analysis of the presence and/or 
absence of various histone modifications have now been conduct- 
ed, providing a catalog of the epigenetic marks found in different 



regions of the Plasmodium falciparum genome [6,8]. In addition, 
analyses of the proteins encoded in the parasite's genome has 
enabled the identification of many of the enzymes responsible for 
deposition of these histone modifications [9], and knockouts of 
select number of these genes have now been conducted [10,11]. 
These studies have reinforced the importance of histone modifi- 
cations in Plasmodium biology, and identified these enzymes as 
potential targets for disease intervention. 

While the role histone modifications play in regulating gene 
expression is well established, an important and poorly understood 
aspect of this process is how various histone modifiers get directed 
to specific places within the genome. In some organisms, the 
production of noncoding RNAs (ncRNAs) has been implicated in 
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Author Summary 

Chemical modifications to histories, the proteins that serve 
as the primary units of chromatin, often determine 
whether specific genes are actively transcribed or con- 
densed into transcriptionally silent regions of the genome. 
In the malaria parasite Plasmodium falciparum, histone 
modifications have been shown to play a significant role in 
controlling gene expression involved in many aspects of 
their lifecycle, including the complex gene expression 
patterns associated with antigenic variation. The various 
histone modifications that are found within the parasite's 
genome have now been extensively cataloged, and the 
enzymes that are responsible for adding and removing 
them have been identified. However, how these enzymes 
are recruited to specific regions of the genome to 
coordinate gene expression is not understood. In this 
paper, we provide the first evidence for recruitment of a 
unique histone methyltransferase to specific regions of the 
genome through its tethering to RNA polymerase II. We 
find that disruption of this interaction results in major 
changes in expression patterns of genes involved in 
antigenic variation, demonstrating the importance of 
regulated recruitment of histone modifiers for parasite 
biology. 



the recruitment of various histone modifiers to specific loci [12]. 
Studies have also shown that once initial marks have been 
deposited, histone modifications can act combinatorially to enforce 
or reverse epigenetic marks, thus leading to "cross-talk" between 
modifications that result in the establishment of different gene 
expression states [13]. However the mechanism by which the 
assembly of the proper complement of modifications is initiated is 
an aspect of chromatin biology that remains in its infancy and 
virtually no work has been done in Plasmodium. 

One of the few well-characterized mechanisms of directed 
recruitment of a histone modifier is that of the SET2 (Su(var)3-9, 
Enhancer of Zeste, Trithorax 2) class of methyltransferases. These 
enzymes deposit a tri-methyl group on the lysine in the 36 th 
position of histone H3 (H3K36me3), which can subsequently be 
removed by its cognate demethylase JmjCl [14,15]. In higher 
eukaryotes, SET2 is recruited by RNA pol II during the act of 
transcribing mRNA, thereby tethering the protein to the 
elongating polymerase complex and thus effectively marking 
regions of the genome that have been transcribed [16,17]. This 
characteristic is mediated by the binding of SET2 directiy to the 
C-terminal domain (CTD) of the enzymatic subunit of the 
polymerase, Rpbl, but only when the CTD has been properly 
phosphorylated. Phosphorylation of the CTD occurs specifically 
when the polymerase is engaged in transcriptional elongation 
[18,19]. Thus SET2 recruitment coincides directly with mRNA 
synthesis and deposition of H3K36me3 is limited to specific 
regions of the genome that have been recently transcribed. This is 
thought to re-compact chromatin that was loosened by the passage 
of RNA pol II and thereby prevent inappropriate initiation of 
transcription from cryptic promoters that might exist within 
coding regions [20,21]. Thus SET2 serves a general function to 
ensure that transcription initiation by RNA pol II is properly 
regulated. 

Phylogenetic studies of histone methyltransferases in P. 
falciparum by Cui et al. identified a putative ortholog of SET2 
(called PfSET2) and biochemical work demonstrated that this 
protein possesses histone methyltransferase activity [9]. More 
recently, Jiang et al. generated a knockout of PfSET2 (also called 



PfSETvs) and showed that this resulted in a profound reduction of 
H3K36me3, providing strong evidence that PfSET2 deposits this 
mark on P. falciparum chromatin [22]. Interestingly, genome-wide 
analysis of the distribution of H3K36me3 found that this 
epigenetic mark was deposited almost exclusively at telomeres 
and at multi-copy gene families that undergo variant, mutually 
exclusive expression, for example the var gene family. Moreover, 
knockout of PfSET2 resulted in disruption of mutually exclusive 
expression while displaying no other detectable phenotype. Thus 
in P. falciparum, SET2 appears to be devoted to regulating mutually 
exclusive expression and antigenic variation. This is in sharp 
contrast to higher eukaryotes where SET2 deposits H3K36me3 
throughout the genome, within virtually all coding regions that are 
actively transcribed. Given these dramatic differences in 
H3K36me3 distribution and function, it is unclear how PfSET2 
specifically marks only very narrow regions of the parasite's 
genome and whether RNA pol II plays a role in its recruitment, as 
it does in higher eukaryotes. Understanding how PfSET2 gets 
targeted to genes that undergo mutually exclusive expression will 
shed light on a key step in the regulation of antigenic variation by 
P. falciparum. 

Examination of RNA pol II from malaria parasites previously 
identified an unusual expansion of the CTD in parasites that infect 
primates when compared to rodent malaria parasites [23], 
however the utility of this expansion was not illuminated. In 
addition, a recent report suggested that epigenetic mechanisms 
related to antigenic variation might be significantly different when 
rodent parasites are compared to those of primates [24]. Given 
these observations, we hypothesized that the expansion of the 
CTD in primate parasites might be involved in mechanisms of 
epigenetic regulation of antigenic variation, specifically var gene 
regulation. The multi-copy var gene family encodes the primary 
antigenic determinant, PfEMPl, expressed on the surface of 
infected red blood cells and has been shown to be transcriptionally 
regulated epigenetically [25]. Using genome comparisons, chro- 
matin immunoprecipitation, in vitro protein binding assays and the 
creation of dominant-negative transgenic lines, we confirmed that 
the histone methyltransferase PfSET2 is an important player in var 
gene regulation. More importantly, we identified a region of the 
protein that binds directly to RNA pol II in vitro and showed that 
disruption of this interaction in transgenic parasites drastically 
alters var gene expression patterns. Surprisingly, unlike the SET2 
orthologs of higher eukaryotes, PfSET2 binds to RNA pol II when 
it is unphosphorylated, a characteristic inconsistent with its 
recruitment during the transcription of mRNAs and providing a 
possible mechanism for how recruitment of PfSET2 is limited to 
very narrow regions of the genome. These data suggest a model in 
which PfSET2 is recruited to var gene loci by RNA pol II during 
the production of previously described oar-specific ncRNAs, and 
that the resulting histone mark plays a role in coordinating 
mutually exclusive expression and var expression switching. 

Materials and Methods 

Antibody production 

Unmethylated, mono, di and trimethylated peptides that 
include and flank the H3K36 residue were synthesized and 
conjugated at Rockefeller University using the following epitope: 
RKSAPISAGI-K(Me0,l,2,3)-KPHRYRPGT. Two rabbits were 
immunized with the trimethylated peptides and rabbit polyclonal 
antibodies were obtained via a 77-day protocol (Covance). To 
increase specificity, the anti-H3K36me3 serum was pre-absorbed 
by incubating with unmethylated, monomethylated, and dimethy- 
lated peptides. 1 ml of a 1:10 dilution of rabbit serum was 
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incubated with 1 5 (U.1 of a 1 mg/ml solution of each peptide for 
30 min at 37 degrees, rocking. Specificity of the final antibody 
preparation was determined using peptide competition assays and 
verified through assaying detection of 15 ug of purified unmethy- 
lated, monomethylated, dimethylated and trimethylated peptides 
through slot-blot assays. 

Immunodetection of H3K36me3 in P. falciparum 
chromatin 

A total of 2 x 10 9 cells of the NF54 line of P. falciparum were lysed 
in RIPA buffer (Sigma; 50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 
1% NP-40, 0.25% sodiumdeoxycholate, 0.1% SDS), plus mam- 
malian protease inhibitor cocktail (Sigma) supplemented with 
200 mg ml-1 PMSF and 4 milligrams ml-1 pepstatin. 1 to 15 
micrograms of lysates were separated by SDS-PAGE using a 15% 
acrylamide gel and transferred to a PVDF membrane. Membranes 
were blocked overnight with 5% milk. Primary antibodies were 
detected with horseradish-peroxidase-conjugated sheep anti-rabbit 
antibodies. Primary antibodies were either the anti-H3K36me3 
described above or commercially available antibodies against H3- 
core. 

Extraction and purification of histones from P. falciparum 

P. falciparum histones were extracted and purified by a modified 
protocol as previously described by Schecter, et al. [26]. Briefly, 
infected NF54 cells were lysed by saponin and released parasites 
were washed twice in 1 x PBS. The parasite cell pellet was 
resuspended in 1 mL of ice cold water to induce lysis and 
incubated for 30 min on a rotator at 4°C . The nuclei was pelleted 
by spinning at 10,000 g for 10 min at 4°C. The supernatant was 
discarded and the pellet was resuspended in 0.4 N H2S04 and 
incubated overnight on a rotator at 4°G. The sample was then 
spun down for 20 min at 14,000 g. The supernatant was dialyzed 
against H 2 0 for 2 h to overnight. The sample was retrieved and 
lyophilized in a SpeedVac overnight. The powder was diluted in 
-100 uL H 2 0. SDS-PAGE/Western was performed to detect the 
H3K36me3 mark. 

Chromatin immunoprecipitation 

Chromatin Immunoprecipitation was performed as previously 
described [5,27] using samples prepared from synchronized ring 
stage parasites from recently cloned lines with established var 
expression profiles [28]. Pre-absorbed anti-H3K36me3 antibodies 
(see above), anti-H3 core histone (positive control) or no antibody 
(negative control) were used for the immunoprecipitation. Isolated 
DNA was then analyzed by quantitative RT-PCR. Graphs display 
the ratio of the relative quantity of bound DNA divided by the 
total input (bound+unbound), normalized to a control gene 
(PfCTRP). 

Expression and purification of P. falciparum and 
P. berghei recombinant CTD of Rpb1 

Genomic DNA from P. falciparum and P. berghei were used as 
template for PCR amplification of the R2 and R3 regions (R2R3) 
of the C-terminal domain (CTD) of Rpbl. The sequence encoding 
a 3 x Flag-tag was incorporated into the reverse primer in each 
reaction. The amplified products were cloned into the pSUMO 
expression vector, which includes an N-terminal Hise tag. 
Recombinant proteins were expressed in BL21(DE3) E. colt cells, 
which were subsequently purified using Nickel-nitrilotriacetic acid 
(Ni-NTA) beads (Qiagen). Proteins were concentrated and stored 
in 50 mM Tris, pH 8.0, 0.1 M NaCl and 10% glycerol. Primers 
used for PCR amplification are described in Table SI. 



Expression of recombinant PfSET2 fragments and the 
SET2-Rpb1 interacting (ySRI) domain from yeast 

The DNA sequences encoding five different PfSET2 frag- 
ments (PfSET2 (1 - 3 59) ! PfSET2 (35 i-828), PfSET2 ( 

1060-1535); 

PfSET2(i54o-2080) an d PfSET2(2264-2548)) were amplified by PCR 
using P. falciparum NF54 genomic DNA as template. The DNA 
sequence encoding the ySRI was amplified by PCR using 
Sacharomyces cerevisiae genomic DNA. The sequence encoding an 
HA epitope tag was incorporated into the reverse primers to create 
HA-tagged versions of each fragment. The sequence of the PCR 
primers is provided in Table SI. The PCR products were cloned 
into pSUMO and protein expression induced in BL21(DE3)RIL 
E. coli cells as described above. Cultures expressing the recombi- 
nant proteins were harvested and lysed as described above, with 
the exception that EDTA and MgCl 2 were added to the lysis 
buffer to final concentrations of 0.5 mM and 1.0 mM respectively. 
Cleared lysates were used for co-immunoprecipitation assays and 
expression of truncates were monitored by SDS-PAGE and 
Western blot. 

In vitro phosphorylation of the CTD of Rpb1 

Purified Flag-tagged R2R3 portions of the CTD of Rpbl from 
P. falciparum and P. berghei were phosphorylated using Cdc2 kinase 
(NEB). Phosphorylation of the CTD was monitored by SDS- 
PAGE and Western blot. 

Co-lmmunoprecipitation assays 

Lysates from bacterial cultures expressing HA-tagged PfSET2 
fragments and the ySRI were incubated with purified, recombi- 
nant Rpbl CTD in both phosphorylated and unphosphorylated 
forms in 3% BSA at 4°C for 16 h. Reactions were precleared on 
control agarose resin (Thermoscientific) and loaded onto Protein A 
agarose beads (Millipore) for two hours. Bead were washed 
extensively three times with 50 mM Tris, pH 8.0, 0.5 M NaCl, 
0.5% Tween-20 and 10% glycerol. Polyclonal anti-HA antibody 
(Rabbit; Sigma) was used to immunoprecipitate P1SET2 and any 
interacting proteins. Immunoprecipitated proteins were separated 
by SDS-PAGE and the CTD of Rpbl was detected by Western 
blotting using anti-Flag antibody. 

SDS-PAGE and western blotting 

Proteins were separated on 4-20% Tris-HCl acrylamide gels 
and analyzed by Western blot. Antibodies were detected using the 
SuperSignal West Pico Chemiluminescent substrate reagents from 
Thermoscientific. Monoclonal mouse anti-HA and anti-His 
antibodies (Sigma) were used at dilutions of 1:2000 and 1:6250 
respectively. Monoclonal anti-Flag (Sigma) was used at a dilution 
of 1:4000. All antibodies were detected using anti-mouse IgG 
(whole molecule) peroxidase, produced in rabbit (Sigma) at a 
dilution of 1:100,000. 

Creation of dominant-negative transgenic parasite lines 

The dominant negative experiments utilized clonally stable 
NF54 bulk cultures previously generated by Frank et al. [28] . Two 
of NF54 subclones called A3 and C3 were cultivated and 
transfected as described by Dzikowski, et al. [29]. Expression 
constructs were created using the PLN-ENRGFP plasmid [30]. 
For the dominant-negative version of PfSET2, the sequence 
encoding the N-terminal 359 amino acids was fused to a C- 
terminal HA tag. A control construct included the sequence 
encoding Firefly luciferase. Both dominant-negative and control 
parasite lines were initially selected by culturing in the presence of 
2 ug/mL blasticidin S HCL for 42 days. Portions of each culture 
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were then transferred to new flasks and grown in the presence of 
10 ug/mL blasticidin for 49 days. The 2 ug/ML blasticidin 
cultures were maintained throughout the duration of the 
experiment. 

RNA extraction and cDNA synthesis 

RNA was extracted as previously described [29]. Briefly, RNA 
was TRiZol (invitrogen) extracted from synchronized late ring 
stage parasites grown at both 2 and 10 ug/mL blasticidin and 
purified on PureLink (invitrogen) columns following manufactur- 
er's protocol. Purified RNA was treated with Deoxyribonuclease I 
(DNAse I) (Invitrogen). cDNA was synthesized from approximate- 
ly 800 ng of RNA in a reaction that included Superscript II RNase 
H reverse transcriptase (invitrogen) with random primers (Invitro- 
gen) as described by the manufacturer. 

Quantitative RT-PCR 

For chromatin immunoprecipitation studies, forward and 
reverse primer pairs with complementary sequences within the 
var gene PF3D7-0421 100 were used for qRT-PCR. The standard 
curve method was used to quantitate mRNA levels (ABI user 
manual). All reactions were compared to control genes encoding 
acyl-tRNA synthetase, actin and a P. falciparum circumsporozoite 
protein. All reactions were performed using ITAQ_ SYBR 
supermix (Bio-Rad) in 10 uL reactions using an ABI Prism 7900 
HT real time PCR machine. For gene expression studies, cDNA 
was evaluated for genomic DNA contamination prior to assessing 
var gene expression. To detect transcription of all var genes, we 
employed a real-time PCR set developed by Salanti et al. [3 1] and 
modified by Dzikowski [32] and Frank [28] . To detect Pfset2 or 
specifically the SRIR domain included in dominant negative 
constructs, specific primers were designed and are shown in Table 
S2. ACT values for each individual primer pair were obtained by 
subtracting the CT value from the CT value of the control seryl- 
tRNA synthetase (User bulletin 2, Applied Biosystems, http:// 
www.appliedbiosystems.com). The formula 2 ACT was used to 
convert the ACT values to relative copy numbers. All reactions 
were performed in duplicate and averaged in Microsoft Excel. 
Transcriptional profiles of all the individual var genes are presented 
as pie graphs prepared as described by Frank et al. [28]. The most 
highly expressed var gene was defined as the dominant gene. 

Results 

Identification of a histone methyltransferase/ 
demethylase pair specific to primate malaria parasites 

Previous examination of the CTD of RNA pol II from 
numerous primate and non-primate Plasmodium species found that 
the CTD is uniquely expanded in parasites that infect primates 
[23]. In higher eukaryotes, this domain contains repeats which 
actively recruit various proteins to the RNA pol II enzyme 
complex. Of particular interest for our studies, when appropriately 
phosphorylated these repeats directiy bind histone modifiers, 
recruiting them to the actively transcribing polymerase complex 
and thus marking recendy transcribed regions of the genome [33]. 
Together these observations led us to hypothesize that primate 
parasites might possess one or more unique histone modifiers 
when compared to rodent parasites, and that such modifiers would 
directly interact with the expanded CTD of RNA pol II. 

As a first step to test this hypothesis, we compared the genome 
sequences of three primate parasites (P. falciparum, P. vivax and P. 
knowlesi) with the genomes of three rodent parasites (P. berghei, P. 
yoellii, and P. chabaudi). Specifically, we used a collection of 202 
transcription factors and histone modifiers identified in P. 



falciparum by Bischoff et al. [34] to perform a comparative screen 
of proteins encoded within the genomes of the other five 
Plasmodium species. Three protein encoding genes were identified 
that were present within the genomes of P. falciparum, P. knowlesi 
and P. vivax but missing in all three rodent lineages. These 
included the histone methyltransferase PfSet2 (PF3D7_1322100) 
and its complementary demethylase PffmjCl (PF3D7J3809900) 
(Figure 1A). These two modifiers are predicted to add and remove 
methyl groups from the lysine in the 36 th position of histone H3 
(H3K36) respectively, thus suggesting they represent a comple- 
mentary pair of histone modifiers specific to primate parasites. Of 
particular interest for this study, SET2 orthologs from higher 
eukaryotes have been shown to directly bind to the CTD of RNA 
pol II, adding confidence to our hypothesis that the presence of 
these histone modifiers might be directiy related to the expanded 
CTD found in primate parasites. The third gene found to be 
unique to primate parasites, a predicted zinc-finger transcription 
factor expressed in gametocytes (PF3D7_1 134600), will not be 
described further here. 

To verify that orthologs of Set2 and JmjCl are in fact absent 
from the rodent parasite genomes and not simply mis-annotated or 
not identified due to low levels of sequence similarity, we examined 
the genome assemblies for syntenic open reading frames that could 
represent the missing genes. All six genomes displayed near perfect 
synteny in the regions surrounding both PfSet2 and PJJmjCl, 
however these two genes are entirely missing from the genomes of 
all three rodent parasites (Figure IB). No predicted pseudogenes or 
other similar genome features were found where these genes were 
predicted to reside, suggesting that insertion/ deletion events define 
the primate vs rodent lineages at these positions in the genome. As 
final confirmation that the genome assemblies are correct, PCR 
primers were designed to the regions of the P. berghei genome 
flanking the syntenic positions where Set2 and JmjCl were 
predicted to exist. PCR amplifications yielded the predicted 
product in both cases and confirmed that neither Set2 nor JmjCl 
exist at these positions in the genome (not shown). Degenerate 
PCR primers to Set2 and JmjCl also failed to amplify products 
using P. berghei genomic DNA as template, further suggesting that 
these two genes are likely not present within the rodent parasite 
genomes. 

Detection of H3K36me3 in P. falciparum chromatin 

Methyltransferases of the SET2 class are predicted to transfer 
three methyl groups to histone H3 at the lysine in the 36 th position 
(H3K36me3) [14]. If PfSET2 possesses this activity, H3K36me3 
should be found within the chromatin of P. falciparum. Conflicting 
reports exist in the literature regarding the presence of this 
particular histone modification. Issar and colleagues were unable 
to detect H3K36me3 using a commercially available antibody 
preparation generated against yeast H3K36me3 [35], and Trelle et 
al. also did not detect this modification in a proteomic analysis of 
chromatin extracted from cultured malaria parasites [36]. In 
contrast, using a different antibody preparation, Cui et al. detected 
H3K36me3 within P. falciparum chromatin and also detected the 
appropriate methyltransferase activity using recombinant PfSET2 
on H3 core [37]. More recendy, Jiang et al. detected H3K36me3 
using an alternative source of commercially available antibodies, 
and found that the level of this modification was greatly reduced 
with PfSET2 was knocked out, thus provided more convincing 
data for the presence of this histone modification in P. falciparum 
and its deposition by PfSET2 [22]. Nevertheless, P. falciparum 
histone H3 has several amino acid polymorphisms close to K36 
when compared to higher eukaryotes, making the use of 
commercially available antibodies generated against H3 peptides 



PLOS Pathogens | www.plospathogens.org 



4 



January 2014 | Volume 10 | Issue 1 | e1 003854 



PfSET2 Recruitment by RNA Pol II 





PHD > 2548 a.a. 



PfJmjCI ( d > => 1259 a.a. 



Chromosome 13 

B 



R falciparum 



P. vivax 



P. knowlesi 



PF3D7J321900 PF3D7J 322000 PF3D7_1322100 (SET2) PF3D7_1 322200 
<— ► ► 



P. berghei 
P. chabaudi 

P.yoellii 



P. falciparum 



P. vivax 



P. knowlesi 



Chromosome 8 

PF3D7_0809700 PF3D7_0809800 PF3D7_0809900 (JmjCI ) PF3D7_0810000 
< < < 



P. berghei 
P. chabaudi 
P.yoellii 



Figure 1. Two histone modifiers unique to primate parasites. A comparative screen of the genomes of primate and rodent malaria parasites 
identified two histone modifiers unique to P. falciparum, P. vivax and P. knowlesi, SET2 and JmjCI. (A) Domain searches of the amino acid sequences 
using SMART (Simple Modular Architecture Research Tool, http://smart.embl-heidelberg.de) identified several domains typical of chromatin 
modifying enzymes, including several PHD domains and a SET2-like methyltransferase domain in PfSET2 and a predicted JmjC type demethylase in 
PfJmjCI. (B) Syntenic regions of chromosomes 13 and 8 of three primate parasites and three rodent parasites. Extensive synteny is observed 
throughout these chromosomal regions, with the exception of SET2 and JmjCI, which are missing from the genomes of the rodent parasites. The 
flanking genes are highly conserved. Figure adapted from Plasmodb.org. 
doi:10.1371/journal.ppat.1003854.g001 



from heterologous organisms potentially problematic, and provid- 
ing a possible explanation for the disparate results. Thus 
independent confirmation of the presence or absence of 
H3K36me3 within chromatin from P. falciparum was desirable. 



Given the identification of PfSET2 as a potentially unique 
histone modifier in primate parasites, a first step was to definitively 
determine if the predicted histone modification was actually 
present within the chromatin of primate parasites. Due to the 
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conflicting results obtained using antibodies generated against 
H3K36me3 peptides from other organisms, we generated custom 
antibodies using synthetic peptides of P. falciparum histone H3. To 
determine the ability of these antibodies to distinguish between 
H3K36, H3K36me, H3K36me2 and H3K36me3, we utilized 
slot-blots of synthetic peptides in all four methylated states. Our 
custom antibodies only recognized H3K36me3 (Figure 2A), thus 
providing us with a reagent to determine if this modification is 
present in chromatin extracted from cultured P. falciparum. 
Western blots of total protein extracts and of acid extracted 
histones obtained from cultured parasite detected a band of 
appropriate size that co-migrated with a band detected by 
antibodies against core H3 (Figure 2B). Immunofluorescence 
assays of fixed parasites also detected H3K36m3 within parasite 
nuclei (Figure SI). These data, combined with the H3 methyl- 
transferase activity previously detected in vitro by Cui et al. [9] as 
well as the recent knockout studies [22] suggest that PASET2 is in 
fact a bona fide H3K36 methyltransferase. 

Unique architecture of PfSET2 and binding to RNA pol II 

Investigations of SET2 orthologs from both yeast and humans 
have identified a conserved architecture in which the methyl- 
transferase domain is found near the N-terminus of the protein 
[16,17] (Figure 2C). The RNA pol II CTD binding region, 
referred to as the SET2-Rpbl -interacting (SRI) domain, is located 
within the C-terminal region of the protein. The SET2 protein 
from P. falciparum has a very different architecture, however [38]. 
The putative methyltransferase domain is found near the C- 
terminus, and the remainder of the protein does not possess any 
regions with significant similarity to the SRI domains identified in 
the human or yeast SET2 proteins (Figure 2C). In addition, 
PfSET2 possesses several PHD domains (domains known to be 
involved in protein-protein interactions and epigenetic regulation) 
that are not found in the SET2 proteins of either yeast or 
mammals (Figure 2C). These differences prevented the computa- 
tional identification of a potential SRI-like domain within PfSET2, 
thus if or how PfSET2 is recruited by RNA pol II remained 
unknown, and we could not rule out the possibility that PfSET2 
does not bind to the RNA pol II complex at all but rather is 
recruited to specific regions of the genome via an alternative 
mechanism. Similarly, the recent finding that the H3K36me3 
mark is not found throughout the genome where RNA pol II 
actively transcribes mRNA further suggested that PfSET2 is not 
recruited in a similar manner to what has been described in higher 
eukaryotes. 

Our original identification of PfSET2 was based on its 
hypothetical interaction with the extended CTD of Rpbl from 
primate malaria parasites. Therefore, we attempted to determine if 
such an interaction actually exists. To determine if PfSET2 can 
interact with the CTD of Rpbl from P. falciparum, we adopted the 
in vitro protein binding assay developed to identify the SRI in both 
human and yeast SET2 [16,17]. In these assays, recombinantly 
produced, epitope tagged portions of SET2 were combined with 
the CTD that was tagged with an alternative epitope. After 
incubation in the appropriate binding buffer, co-immunoprecip- 
itation (co-IP) assays were performed to identify the regions of 
PfSET2 that stably interact with the CTD. Five different 
fragments covering the entire length of PfSET2 (with the 
exception of the two tandemly arranged PHD domains and the 
methyltransferase region) (Figure 2C) were produced recombi- 
nantly in E. coli and incubated with the R2/R3 regions of RNA 
pol II from P. falciparum (Figure S2) [23]. These assays found that 
the first 359 amino acids of the N-terminus of PfSET2 bound 
robustly to the CTD of PfRpbl, whereas no other region of the 



protein displayed any detectable binding activity (Figure 2D). We 
hereafter refer to this region of PfSET2 as the SET2-Rpbl- 
interacting region (PfSRIR). The SRI domain from yeast SET2 
does not require the entire CTD for efficient binding and can also 
interact with a substantially truncated portion of the CTD repeat 
region [17], as observed in both in vitro assays and crystal structure 
analysis [39] . We similarly found that the PfSRIR does not require 
the entire CTD for binding and can also interact with a 
substantially shorter CTD from P. berghei (Figure 2E). These 
experiments show that despite the lack of any detectable similarity 
in amino acid sequence or domain architecture, PfSET2 does 
indeed bind to the CTD of Rpbl, suggesting that it is similarly 
recruited to specific regions of the genome through the action of 
RNA pol II. 

Binding of PfSET2 to the CTD of RNA Pol II is disrupted by 
phosphorylation 

Binding of SET2 to Rpbl in humans and yeast has been shown 
to be dependent on phosphorylation of the serine residues found 
within the heptad repeats of the CTD [16,17]. All higher 
eukaryotic CTDs consist of a tandem array of the conserved 
heptad YSPTSPS/K. The human CTD contains 52 heptads, 
whereas the CTD of the yeast S. cerevisiae has 26 [40] . During the 
act of transcription, the CTD undergoes changes in the 
phosphorylation status of the serines at positions 2 and 5 of the 
heptads, thereby altering the conformation of this domain and 
changing which proteins are recruited by the CTD to the protein 
complex [41]. Phosphorylation at positions 2 and 5 of the heptads 
occurs when the polymerase is engaged in transcriptional 
elongation of mRNAs, and this specific modification has been 
shown to recruit SET2 in humans and yeast [18,19]. This property 
directly ties SET2 recruitment to mRNA elongation, thus in the 
genomes of these organisms, H3K36me3 is found within the body 
of most transcribed genes. 

Given that Rpbl of P. falciparum possesses very similar heptad 
repeats as found in model eukaryotes, including the serines at 
positions 2 and 5 that serve as phosphorylation sites (Figure S2), 
we anticipated that PfSET2 would similarly require phosphory- 
lation for binding to the CTD. However, our initial experiments 
detected robust binding of PfSET2 to the unphosphorylated CTD 
(Figure 2D), suggesting that this interaction might be fundamen- 
tally different in malaria parasites. To further determine if 
phosphorylation of the CTD influences binding by PfSET2, we 
exposed recombinant CTD to Cdc2 kinase, a kinase known to 
phosphorylate the serine residues within CTD heptads, then 
repeated our co-IP assay. Surprisingly, phosphorylation of the 
CTD completely disrupted PfSET2 binding (Figure 3A and B). 

The affinity of PfSET2 for an unphosphorylated CTD and 
disruption of binding by phosphorylation is fundamentally 
opposed to the current model of SET2 recruitment as described 
for yeast and mammals. If true, this might suggest a very different 
role for PSET2 in parasite biology. Given the potential 
significance of this finding, we chose to more rigorously explore 
this interaction by directly comparing binding affinities of the yeast 
SRI (ySRI) to that of the PfSRIR. Recombinant ySRI and PfSRIR 
were incubated with both the unphosphorylated and phosphor- 
ylated versions of the P. falciparum CTD and co-IP assays were 
performed side-by-side. In agreement with previous reports 
[17,18], the ySRI displayed a robust affinity for the phosphory- 
lated CTD, with much weaker binding to the unphosphorylated 
form also evident (Figure 3C). This minor binding to the 
unphosphorylated Plasmodium CTD is somewhat different from 
what was observed for the yeast CTD and might reflect subtle 
differences in the heptad repeat structure between the two 
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Figure 2. Detection of H3K36me3 in chromatin from P. falciparum and identification of the Rpbl binding region of PfSET2. (A) Slot 
blots of the synthetic peptide sequence RKSAPISAGIKKPHRYRPGT. This sequence includes K36 of histone H3 of P. falciparum (underlined). Lane 1, 
unmethylated peptide. Lane 2, peptide synthesized with a mono-methyl group at the K36 position. Lane 3, peptide synthesized with a di-methyl 
group at the K36 position. Lane 4, peptide synthesized with a tri-methyl group at K36 position. All lanes included 15 |ig of purified peptide. The dot 
blot was probed with antibodies against the tri-methylated form of the peptide and demonstrated that the antibodies are specific to the tri- 
methylated form. (B) Western blot of P. falciparum protein extracts taken from early trophozoites. Lane 1, blot of total parasite proteins probed with 
anti-H3K36me3 antibodies. Lane 2, blot of total parasite proteins probed with anti-H3-core antibodies. Both antibodies recognize bands migrating at 
— 18 kDa. Lane 3, blot of acid extracted histones obtained from cultured parasites and probed with anti-H3K36me3 antibodies. (C) Comparison of the 
architecture of SET2 orthologs from 5. cerevisiae, H. sapien and P. falciparum. Note that the SET2-Rpb1 -Interacting domain (SRI) is found at the C- 
terminal end of both the yeast and human SET2 proteins. The protein domains shown in the figure are displayed using the format derived from 
SMART (Simple Modular Architecture Research Tool, http://smart.embl-heidelberg.de). Below PfSET2 are displayed the portions produced as HA- 
tagged, recombinant proteins for use in co-immunoprecipitation assays. The numbers below each fragment represent the amino acid positions 
within the full-length protein. The tandem PHD domains (amino acids 829-1 060) were not included because the DNA encoding the this region of the 
protein was unstable in the expression vector. The SET domain was also not included because the structure and function of this domain is conserved 
and was previously shown not to bind the CTD in yeast and human systems. (D) Co-immunoprecipitation assays used to identify the region of PfSET2 
that interacts with the CTD of Rpb1. Flag-tagged fragments of the CTD of Rpb1 shown in E were incubated with bacterial lysates containing the 
truncates of PfSET2 shown in C. Immunoprecipitation was performed using anti-HA beads followed by SDS-PAGE and Western blotting using anti- 
Flag antibodies. Only the most N-terminal portion of PfSET2 (amino acids 1-359) was found to interact with the CTD of Rpb1. The control reaction 
consisted of an HA-tagged PHIST protein similarly incubated with the CTD of Rpb1. (E) The N-terminal region of PfSET2 also binds to Rpb1 regions 
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with fewer numbers of repeats. Co-immunoprecipitation assay performed 
berghei (right lane). The P. berghei CTD has only 8 heptad repeats. 
doi:1 0.1 371 /journal.ppat.1 003854.g002 

organisms [23]. In stark contrast, the PfSRIR only bound the 
unphosphorylated CTD, thus displaying the opposite affinity and 
much greater specificity than the ySRI. These results were 
unexpected and suggest that if Rpbl of P. falciparum undergoes 
phosophorylation of its CTD heptads similarly to all other studied 
eukaryotes, PfSET2 might not interact with the RNA pol II 
complex when it is actively transcribing mRNA, but rather while it 
is engaged in an alternative function. 

H3K36me3 marks both active and silent var genes 

If binding of PfSET2 to the RNA pol II complex requires 
phosphorylation of the serines of the CTD of Rpbl as has been 
shown in higher eukaryotes, we anticipated that the H3K36me3 
mark would be found at actively transcribed genes within the 
parasite's genome, including most or all housekeeping genes. 
However, our in vitro data suggested that PfSET2 binds to RNA 
pol II when the CTD is unphosphorylated, suggesting that it might 
be recruited when the polymerase complex is engaged in a 
function other than transcribing an mRNA. To investigate these 
potentially contradictory models, we performed chromatin immu- 
noprecipitation (ChIP) experiments to examine the deposition of 
H3K36me3 in live parasites utilizing the custom antibodies 
generated against H3K36me3. Following standard ChIP proce- 
dures, we isolated chromatin from cultured asexual parasites and 
performed quantitative real-time PCR on the immunoprecipitated 
fractions. We examined two "housekeeping" genes that are known 
to be actively transcribed into mRNA as well as a stage specific 
gene (csp) that is transcriptionally silent during asexual develop- 
ment [42,43]. In addition, we investigated the deposition of 
H3K36me3 at var genes. These genes serve as a particularly useful 
model since they are subject to both silencing and activation 
through epigenetic mechanisms [25]. Thus, it is possible to 
examine individual var genes when they are "on" and actively 
transcribing mRNA or when they are "off" and no mRNA 
synthesis can be detected. We performed parallel ChIP experi- 
ments using different isogenic parasite lines that had been grown 
for a limited number of generations after cloning and that 
expressed different subsets of the var gene family. This enabled us 
to examine individual var genes in both their active and silent 
states. 

The ChIP data found very little or no H3K36me3 at the two 
actively transcribed housekeeping genes or at the transcriptionally 
silent csp locus (Figures 3D and S7), suggesting that this mark is not 
simply associated with recently transcribed regions of the genome. 
However the mark was easily detectable within the coding regions 
of var genes. More importantly, we found this mark to be equally 
present within the coding regions of var genes regardless of 
whether they were active or silent. Similar results were recendy 
reported by Jiang et al. using a genome-wide analysis of 
H3K36me3 deposition [22]. The absence of H3K36me3 at 
actively transcribed housekeeping genes and its presence at both 
silent and active var genes implies that, unlike in yeast and 
mammals, H3K36me3 is not a universal mark for actively 
transcribed chromatin and further suggests that PfSET2 is not 
recruited by RNA pol II when it is actively synthesizing mRNA. 
The disruption of the in vitro PfSET2/Rpbl interaction by 
phosphorylation similarly suggests an interaction unrelated to 
mRNA synthesis. However, RNA pol II has been shown to 
transcribe ncRNAs from var introns when the genes are both 
actively expressed and when they are silent (Figure 3E) [27,44], 



as in D, using either the CTD of Rpbl from P. falciparum (left lane) or P. 



thus providing an alternative opportunity for the recruitment of 
PfSET2 to var genes that is independent of mRNA synthesis. 
ncRNA production at var loci therefore provides a possible 
alternative opportunity for RNA pol II to recruit PfSET2, and 
would be consistent with the presence of this mark at both active 
and silent var loci. 

Over-expression of a dominant-negative version of 
PfSET2 induces profound changes in var gene expression 

The multi-copy var gene family encodes the variant surface 
antigen PfEMPl [45 — 47] . The family includes approximately 60 
var genes expressed mutually exclusively, with the transcriptionally 
active copy determining the antigenic and virulence phenotype of 
the infected RBC. Once activated, a var gene tends to remain 
active for many replicative cycles and switching is detectable only 
after lengthy time periods in cultured parasites [28,48,49]. This 
reflects the low transcriptional switching rate that is typical of this 
gene family and that is thought to be required for the efficient 
antigenic variation needed to maintain a long-term infection 
within the human host. The detection of H3K36me3 at var loci 
suggested that PfSET2 might play a role in var gene transcriptional 
regulation, and parasites in which PfSET2 was knocked out 
displayed a complete loss of mutually exclusive var gene expression, 
thus confirming the importance of PfSET2 in var gene regulation 
[22]. However, how PfSET2 is specifically recruited to var gene 
loci and a possible role for RNA pol II in its recruitment remained 
unresolved. If PfSET2 is recruited to var genes through its 
interactions with the RNA pol II complex as theorized here, 
disruption of this interaction could lead to changes in var gene 
expression. We aimed to test this model by disrupting PfSET2/ 
RNA pol II interactions in transgenic parasites. 

In Figure 2, we identified the PfSRIR by its ability to bind to the 
CTD of Rpbl in vitro and theorized that this binding activity 
mediates its recruitment to the RNA pol II complex. This region is 
found within the most N-terminal 359 amino acids and is relatively 
distant from other portions of the protein that contain recogniz- 
able motifs (Figure 4A). If this portion of the protein is sufficient to 
mediate CTD binding as observed in the in vitro assay, a truncated 
version of the protein expressed in transgenic parasites containing 
only this 359 amino acids could potentially compete with 
endogenous PfSET2 for binding to Rbpl. Since the truncated 
version of the protein is enzymatically dead, this could result in a 
"dominant-negative" phenotype, potentially altering or disrupting 
var gene regulation. While the phenotype of a dominant negative is 
typically less pronounced than a knockout, this method has the 
advantage of enabling the investigator to probe the importance of 
specific regions or domains of the protein being studied, thus 
revealing mechanistic insights not apparent with a knockout. In 
this case, we were interested in testing the hypothesis that over- 
expression of the PfSRIR alone will result in an alteration in var 
gene expression, presumably by affecting the ability of RNA pol II 
to recruit PfSET2. We expected the phenotype to be less severe 
than what was observed for the PfSET2 knockout, particularly 
since the endogenous PfSET2 gene remained unaltered and fully 
active, with only its recruitment by the RNA pol II complex 
affected. 

To over-express the PfSRIR of PfSET2, we cloned a fragment 
of the gene encoding the first 359 amino acids into the vector 
pLN-ENRGFP [30]. This vector expresses the gene of interest 
under control of the calmodulin promoter while also expressing 
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Figure 3. Phosphorylation dependence of PfSET2 binding to the CTD and enrichment of H3K36me3 at var loci. (A) Co- 

immunoprecipitation assays showing that phosphorylation of the CTD of Rpb1 eliminates binding by PfSET2. The two left lanes show the flag-tagged 
P. falciparum Rpb1 in its phosphorylated (ll 0 ) or unphosphorylated (ll a ) states. The right two lanes show blots of co-immunoprecipitation assays after 
incubation of bacterial lysates containing the HA tagged, N-terminal region of PfSET2 (PfSET2 n _3 59 ) with phosphorylated and unphosphorylated CTD. 
Phosphorylation of the CTD disrupts binding by PfSET2. (B) Co-immunoprecipitation assays as in A performed with the CTD of Rpb1 from P. berghei. 
(C) Direct comparison of the CTD binding affinities of SET2 from S. cerevisiae and P. falciparum. The SRI from S. cerevisiae (left two lanes) interacts 
preferentially with the phosphorylated form of the P. falciparum CTD (left) but also displays lesser binding to the unphosphorylated form (right). The 
right two lanes show the same assay with the P. falciparum SRIR region (amino acids 1-359). The SRIR displays specific binding to the 
unphosphorylated CTD (right) and little or no binding to the phosphorylated form (left).(D) Chromatin immunoprecipitation assays from ring stage 
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parasites using anti-H3K36me3 antibodies. The assays show enrichment of the H3K36me3 mark in both exons of var gene PF3D7_0421 100. Lanes 1 
and 2 represent control genes encoding seryl t-RNA synthetase and actin, respectively. Lane 3 represents the gene for circumsporozoite protein. The 
remaining lanes represent regions within the coding portion of both exons of the var gene PF3D7_0421 100. Black bars show results from chromatin 
extracted from the C3 line of NF54, in which PF3D7_0421 100 is the actively expressed var gene. The gray bars show chromatin extracted from the A3 
line of NF54 in which this var gene is transcriptionally silent. The bars display the mean +/- standard deviation of relative amounts of bound DNA 
(see methods) from four independent experiments. Similar experiments using no-specific antibodies or control antibodies against core histone H3 
showed no specific enrichment at PF3D7_0421 1 00 (not shown). The graph shown represents the signal for each gene normalized to the control gene 
ctrp. The data is shown as % input without normalization in Figure S7. (E) Schematic representation of a typical var gene. Note that each gene has 
three promoters, one upstream of the coding region and responsible for transcribing an mRNA from the single, active var gene and two within the 
intron transcribing ncRNAs in opposite directions. The intron promoter that transcribes the antisense ncRNA is active during rings stages and is only 
active in genes that are also transcribing mRNA. The intron promoter transcribing ncRNA in the sense direction leads to expression of "sterile", 
ncRNAs during late trophozoites and schizonts and is active in most or all members of the var gene family. 
doi:1 0.1 371 /journal.ppat.1 003854.g003 



the blasticidin S deaminase (bsd) gene to enable selection of transfected 
parasites. The use of bsd as a selectable marker enables control 
of transgene expression levels by manipulating drug concentra- 
tions in the culture media [50]. At low concentrations of blasti- 
cidin (2 |J.g/ml), a low but detectable level of the transgene 
transcript was observed while at high concentrations of blasticidin 
(10 ug/ ml), expression levels increased by ~ 10-15 fold (Figure 4B). 
If the expression of the PfSRIR has a dominant-negative effect, 
this should be most easily detected at high levels of expression 
since it is predicted to work in a competitive fashion. We also over- 
expressed Firefly luciferase as a negative control. Parasites over- 
expressing the PfSRIR construct were viable and displayed no 
obvious growth defect. To assay for effects on var gene expression, 
we transfected two parasite lines (subclones of NF54 called A3 and 
C3) that were previously shown to exhibit relatively stable var gene 
expression patterns [28]. These lines were first transfected and 
grown under 2 ug/ml blasticidin pressure, after which parasites 
were synchronized and RNA extracted from ring stages, var gene 
expression was determined by quantitative realtime RT-PCR 
using a primer set designed to specifically detect each individual var 
gene in the parasite's genome [28,31]. As expected, over- 
expression of the luciferase control (by increasing the concentra- 
tion of blasticidin in the culture media from 2 to 10 ug/mi) did not 
lead to any pronounced change in var gene expression, and the 
dominantly expressed var gene remained dominant in parasites 
grown at each drug concentration (Figure 4C). In contrast, when 
parasites were forced to over-express the PfSRIR dominant- 
negative construct, profound changes in var gene expression were 
observed, specifically leading to var gene expression switching in 
both transfected lines (Figure 4D). This phenotype was observed in 
multiple independent repeats of the experiment (Figures S3, S4, 
S5, S6). The fact that no changes were observed when luciferase 
was over-expressed at similarly high levels indicates that the effect 
was not induced by selection with blasticidin alone. 

Interestingly, in both transfected lines, a single var gene 
(PFL0030c) was strongly upregulated in response to over- 
expression of the dominant-negative construct. In A3, this gene 
became expressed at a level equal to the previously dominant 
transcript (MAL6P1.252) while in C3 it almost entirely replaced 
the previously active gene (PFD0625c). This var gene (PFL0030c 
or PFL3D7_1200600), also referred to as var2csa, is an unusual 
member of the var gene family that is under the control of a unique 
promoter type (called upsE). In addition, its transcript appears to 
be under translational control through the presence of an 
upstream open reading frame within the 5' leader of the transcript 
[51,52]. It is possible that over-expression of the PfSET2 
dominant-negative construct specifically induces expression of 
var2csa, or alternatively that this gene simply happens by chance to 
be preferentially activated in these closely related parasite 
subclones. A comprehensive set of experiments utilizing the 
generation of numerous additional transfected parasite lines 



derived from several distinct geographical isolates will be required 
to definitively address this question. 

Discussion 

The work presented here and in the recent paper by Jiang et al. 
both support a key role for PfSET2 in var gene regulation and 
antigenic variation by P. falciparum, however the details of how this 
histone mark influences var gene expression remain to be 
illuminated. In their analysis of the PfSET2 knockout lines, Jiang 
and colleagues observed that H3K36me3 is greatly reduced or 
absent at the mRNA transcription start site of the active var gene 
while this mark covers the promoter regions of silent var genes 
[22]. This led them to propose the simple hypothesis that 
H3K36me3 is a silencing mark required to keep all but a single 
member of the var gene family in a transcriptional repressed state. 
However this hypothesis does not offer an explanation for the 
additional observation that H3K36me3 is heavily present 
throughout the coding regions of all var genes, regardless of their 
transcriptional state. This differs from other histone marks 
associated with var genes (for example H3K9me3 or H3K9ac) 
which seem to more strictly mark the promoter and 5' upstream 
regions of either active or silent genes. The fact that all var genes, 
both active and silent, are marked by H3K36me3 suggests that this 
histone modification might have an additional or alternative 
function than simply being a mark for promoter silencing. One 
possibility is that H3K36me3 contributes to the recognition of 
individual var genes as members of the var gene family and 
therefore subject to mutually exclusive expression. It has 
previously been observed that when var upstream promoters are 
isolated away from the influence of an adjacent var intron, for 
example when isolated on episomes, they are no longer subject to 
mutually exclusive expression and become constitutively active 
[53-55]. This implies that they are no longer recognized as 
members of the var gene family. The regulatory activity of var 
introns has been shown to be dependent on their own promoter 
activity, a property responsible for the production of previously 
identified noncoding, or "sterile" RNAs [44,47]. The regulatory 
properties of var introns is disrupted if the sequence responsible for 
their promoter activity is deleted [56,57], however proper 
regulation is restored if the var intron is replaced by an alternative 
promoter sequence [54]. These observations suggest a model in 
which var introns exert their influence not through their specific 
DNA sequence per se, but rather through the recruitment of RNA 
pol II, which in turn recruits PfSET2 to the var locus, thereby 
resulting in the deposition of H3K36me3. Thus this model 
provides a possible mechanistic explanation for several previous 
observations regarding the role of var introns in coordinating 
expression of the var gene family. 

The recruitment of PfSET2 during the transcription of 
noncoding RNAs rather than mRNAs could also provide an 
explanation for the unusual observation that PfSET2 binds to the 
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Figure 4. Over-expression of a dominant-negative version of PfSET2 in cultured parasites. (A) Schematic diagrams showing the domain 
structures of the endogenous PfSET2 protein (top) and the truncated, dominant-negative version that contains only the N-terminal 359 amino acids 
(bottom), including the CTD-binding domain (SRIR). (B) Histograms showing RNA expression levels of the endogenous PfSET2 gene (PfSET2) and the 
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dominant-negative construct (PfSRIR) under 2 and 10 ^g/ml blasticidin. Two separate subclones of NF54 were transfected, A3 (left) and C3 (right). (C 
and D) Pie charts representing var gene expression patterns for A3 (left) and C3 (right) when over-expressing either firefly luciferase (C) or the PfSET2 
dominant-negative construct (D). The different wedges of the pie indicate the proportion of the total var mRNA pool represented by transcripts for 
each individual gene (as described in [28]). 
doi:1 0.1 371 /journal.ppat.1 003854.g004 



CTD when it is unphosphorylated. In model organisms, when 
transcribing mRNA the CTD undergoes sequential phosphoryla- 
tion at the S2 and S5 positions of the heptad repeats. While 
phosphorylation of S5 is necessary for initiation of transcription, it 
is phosphorylation at S2 that is important for choosing the proper 
site of termination as well as recruiting factors required for 
polyadenylation [58]. Interestingly, var ncRNAs terminate at a 
different position than var mRNAs transcribed from the same 
DNA template, and they do not get polyadenylated [27]. This 
supports the notion that var ncRNAs are likely transcribed by an 
RNA pol II complex in which the CTD is not typically 
phosphorylated. When these observations are taken together an 
attractive model emerges for how PfSET2 gets recruited only to 
specific regions of the genome. Only when transcribing a ncRNA 
does the CTD assume an atypically or unphosphorylated 
conformation. The recruitment of PfSET2 and the deposition of 
the H3K36me3 mark would thus be limited to these regions of the 
genome. Consistent with this hypothesis, recent genome-wide 
mapping of H3K36me3 also found this mark at telomeres [22], a 
region of the genome that has been shown to transcribe ncRNAs 
in a fashion similar to var genes [59,60]. H3K36me3 was also 
detected at some members of other multi-copy gene families 
thought to be involved in antigenic variation, however it is not 
known if ncRNAs are associated with these regions of the genome. 
Final validation of this model will require the development of 
reagents able to detect and differentiate between the different 
phosphorylated forms of RNA pol II. 

The role of RNA pol II in recruiting histone modifiers has 
become evident through studies of model eukaryotes and likely 
plays multiple roles in regulating var gene expression. Each var 
gene contains three RNA pol II promoters, one upstream of the 
coding region that is responsible for mRNA production and two 
located within the intron that result in production of both sense 
and anti-sense ncRNAs (Figure 3E). The upstream promoter is 
subject to mutually exclusive expression and thus only recruits 
RNA pol II when the gene is in the active state and is producing 
mRNA. However, previous work showed that active transcription 
from this promoter is necessary for maintenance of epigenetic 
memory [55], suggesting the possibility that in addition to 
producing mRNA, the RNA pol II complex also plays a role in 
the deposition of histone modifications when engaged at this 
promoter. The intronic promoter that drives expression of anti- 
sense transcripts similarly only recruits RNA pol II to the single, 
actively expressed gene, and it does so in ring stage parasites, at the 
same time the mRNA is transcribed. This promoter activity was 
described extensively in the recent paper by Jiang et al. [22], 
however how it influences var gene expression is not known. The 
third promoter is responsible for expression of the sense ncRNAs 
(originally called sterile KNAs) that initiate from the introns of var 
genes. These promoters recruit RNA pol II during the late 
trophozoite and schizont stages and appear to be active at all var 
genes simultaneously [47,61]. This promoter is the most likely to 
be responsible for recruiting PfSET2 to var loci, given that all var 
loci are marked by H3K36me3 regardless of whether the other 
two promoters are active or silent. Previous studies have proposed 
a "pairing" mechanism by which var introns and upstream 
promoters directly interact in a one-to-one basis [53,62], and 



DNA elements responsible for pairing were recendy identified 
[63]. Such interactions are predicted to result in the formation of a 
loop structure that would bend the intron back into close 
proximity to exon 1, thereby enabling the RNA pol II/PfSET2 
complex to deposit the H3K36me3 mark throughout both exons 
of the gene. 

Over-expression of the PfSET2 dominant-negative construct led 
to accelerated var gene expression switching in our transfected 
lines. The construct was not designed to reduce the enzymatic 
activity of the endogenous protein, but rather to interfere with its 
recruitment to the RNA pol II complex, thereby down-regulating 
its ability to modify chromatin at var genes. We hypothesize that 
H3K36me3 is involved in recognition of var genes for inclusion in 
the mutually exclusive expression pathway. If correct, down- 
regulation of PfSET2 could "loosen" this control, thereby resulting 
in accelerated switching. This model also predicts that a full knock- 
out of PfSET2 would completely disrupt mutually exclusive 
expression, leading to expression of many or all var genes 
simultaneously. This is exactiy what was observed in a recent 
study describing parasites in which PfSET2 was knocked out [22]. 
The fact that var2csa was preferentially activated in response to 
over-expression of the PfSET2 dominant-negative construct raises 
the question of whether this var gene occupies a unique position 
within the regulatory pathway that coordinates var gene expres- 
sion. In addition to being regulated at the translational level and 
having an upstream promoter region unlike other var genes, this 
gene has also been reported to display an exceptionally high "on" 
switching rate in certain subclones [64]. These authors proposed 
that high spontaneous switching rates to this gene combined with 
translational repression might constitute one component of the var 
gene regulatory pathway. Our data might provide additional 
evidence of a unique role for var2csa in controlling var gene 
expression and antigenic variation. 

The lack of SET2 in the genomes of rodent parasites is perhaps 
more puzzling, although it is consistent with a possible role for 
PfSET2 in antigenic variation, a process that has been shown to be 
regulated somewhat differently in parasites of rodents [24] . In our 
in vitro binding assays, PfSET2 binds equally well to both the 
expanded CTD of P. falciparum as well as the much shorter CTD of 
P berghei, conceivably arguing that the CTD expansion in primate 
parasites is unrelated to interactions with PfSET2. Alternatively, 
the CTD is thought to interact with several proteins simulta- 
neously during transcription, and thus an extended length could 
provide it with a greater capacity to recruit additional proteins to 
the transcriptional complex. Hence the expanded CTD could 
provide the transcriptional complex the added capacity to recruit 
PfSET2 in addition to other proteins required for its functionality. 
This property would not be evident in our in vitro binding assays, 
which utilized individual proteins. A similar interpretation can be 
made for the yeast system, in which a minimum of eight heptads 
are required for viability of the organisms, but binding of SET2 is 
evident with only three in an in vitro assay [17]. It is also worth 
noting that loss of the SET2 ortholog does not necessarily imply 
that rodent parasites do not possess H3K36me3 in their 
chromatin, but rather than this particular methyltransferase that 
binds direcdy to the unphosphorylated CTD of RNA pol II has 
been lost. Jiang et al. found that knocking out PfSet2 does not result 
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in a complete loss of H3K36me3 in P. falciparum [22], suggesting 
that an alternative enzyme exists with this methyltransferase 
activity. 

In conclusion, here we provide data to support the role of the 
histone methyltransferase PfSET2 in regulating var gene expres- 
sion and antigenic variation in P. falciparum. Perhaps more 
importandy, we also propose a mechanism for its recruitment to 
var loci through interactions with RNA pol II, potentially during 
the production of ncRNAs. These observations provide an 
additional degree of understanding for how proper chromatin 
structure is assembled at various genomic positions, and may 
provide a new paradigm for understanding how other aspects of 
parasite biology are controlled. 
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Figure SI Immunofluorescence assay (IFA) of rings and 
schizonts using anti-H3K36me3 antibodies. Localization of the 
antibody is shown in red, the parasite nuclei are stained with Dapi 
(blue). In both stages the antibody detects a single spot in which 
this histone modification appears to be localized. Similar results 
were recently reported by Jiang et al (2013). These results contrast 
with ChIP data that detect H3K36me3 at all var loci and 
telomeres, which have previously been shown to localize at 
multiple locations within the nucleus. A possible explanation is 
that in fixed cells, the H3K36me3 epitope might only be accessible 
to the antibody when it is in open chromatin, and therefore the 
IFAs only allow visualization of the single active var gene. 
Additional work will be required to determine the cause of this 
pattern. 
(PDF) 

Figure S2 The amino acid sequence of the R2 and R3 regions of 
the C-terminal domain of Rpbl from P. falciparum 3D7. The 
heptad repeats typical of Rpbl are underlined, and the serine 
residues that are sites for phosphorylation are marked with 
asterisks. 
(PDF) 

Figure S3 var gene family transcription profile from A3 cultures 
(Figure 4) shown as bar graphs. We used the var primer set to 
detect transcription as designed by Salanti et al. var gene expression 
patterns when overexpressing Firefly Luciferase at 2 ug/ml (A) 
and 10 ug/ml blasticidin (B). var gene expression profiles in the 
presence of the dominant-negative, PfSRI at 2 ug/ml (C) and 
10 ug/ml blasticidin (D). The plO primer pair detects PFL0()30c, 
or var2csa, at higher concentrations of blasticidin. 
(PDF) 

Figure S4 var gene family transcription profile for C3 cultures 
from Figure 4 also shown as bar graphs. The dominant var gene in 
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